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Abstract 

In this note we describe a simple and intriguing observation: the quantum Fourier transform 
(QFT) over Zq, which is considered the most "quantum" part of Shor's algorithm, can in fact 
be simulated efficiently by classical computers. 

More precisely, we observe that the QFT can be performed by a circuit of poly-logarithmic 
path- width, if the circuit is allowed to apply not only unitary gates but also general linear gates. 
Recalling the results of Markov and Shi [9] and Jozsa [7] which provided classical simulations 
of such circuits in time exponential in the tree-width, this implies the result stated in the title. 

Classical simulations of the FFT are of course meaningless when applied to classical input 
strings on which their result is already known; Our observation might be interesting only in the 
context in which the QFT is used as a subroutine and applied to more interesting superpositions. 
We discuss the reasons why this idea seems to fail to provide an efficient classical simulation of 
the entire factoring algorithm. 

In the course of proving our observation, we provide two alternative proofs of the results of 
[9j [7] which we use. One proof is very similar in spirit to that of f9] but is more visual, and is 
based on a graph parameter which we call the "bubble width" , tightly related to the path- and 
tree-width. The other proof is based on connections to the Jones polynomial; It is very short, 
if one is willing to rely on several known results. 



1 Introduction 

In our attempts to understand and characterize the quantum computational power, it is interesting 
to understand which parts of quantum computation are truly quantum, and which can be simulated 
efficiently by classical computers. This has been the subject of many works over the past few years, 
e.g., the Gottesman-Knill theorem [10], providing a simulation of quantum circuits that use only 
Clifford group gates, the simulations by Vidal of quantum circuits that use only limited amount of 
entanglement |13] , and the efficient simulation of circuits using only "match gates" [14:\ [T2] . 

*This result was presented in informal discussions in QIP 2006, Paris. 
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Of particular interest in our context is the recent work of Markov and Shi [9] who considered 
quantum circuits restricted not in the type of gates they use, but rather in the topology of the graph 
corresponding to the quantum circuit (the graph whose nodes corresponds to the quantum gates, 
and whose edges correspond to the wires in the circuit). They show that quantum circuits can be 
simulated classically in time polynomial in the number of gates and exponential in a topological 
parameter called the tree- width of the circuit graph. In [9j, Markov and Shi raised the question of 
whether the Quantum Fourier transform can be assigned small tree-width quantum circuits, which 
would imply its efficient classical simulation by their theorem. 

In this note we observe that a simple generalization of the results of Markov and Shi [9j allows 
us to do this, namely, to show that the quantum Fourier transform can be simulated classically in 
polynomial time. To state our result precisely requires a little more detail which we provide now. 

Our approach begins by introducing a topological parameter of a graph called the bubble-width 
of the graph. It will turn out that the bubble width is closely related to the tree width but we find 
that the bubble width is a more visual parameter that is easier to work with. It is defined roughly 
as follows: Imagine the graph is embedded in in some way, and that a huge spheric bubble sits 
very far away from the graph. The bubble approaches and "eats" the nodes of the graph one by 
one until eventualy it has swallowed the entire graph which now sits inside it. We think of the 
edges of the graph, and of the surface of the bubble, as flexible objects, made of rubber, say, and 
so in the process of the swallowing, both the surface of the bubble and the nodes and edges of the 
graph can be moved, stretched, or bent, in a continuous manner. In topological language, we allow 
isotopies of the bubble and of the graph. The goal is to find a way for the bubble to swallow the 
graph, such that the number of edges of the graph that cross the surface of the bubble at any given 
point does not exceed a certain number c. The minimal number c for which such swallowing is 
possible is called the bubble-width of the graph. 

We shall consider a much more general class of circuits than quantum circuits which we call 
operator circuits. In such circuits, the gates operate on the n-fold tensor product of two dimensional 
vector spaces, the same space as the Hilbert space of n qubits. However, the gates which we allow 
are not necessarily unitary gates, or even quantum permissable gates, i.e., completely positive 
maps. In fact, we simply allow any linear transformation from k to £ qubits. Just like in the case 
of quantum circuits, there are n input bits (some of which might be constant) and there are m 
output bits, one of which is marked to be the answer of the computation. 

For an operator circuit we show the following: 

Theorem 1.1 Given an operator circuit Q, denote the graph associated with it by Gq. Let 
BW{Gq) be the bubble-width of this graph. Given an input string x, denote by Q{x)q the vec- 
tor that is the projection of Q applied to x onto the sub space that has the answer qubit 0. There 
exists a classical efficient algorithm that outputs the exact norm squared of Q{x)o; moreover, the 
time that the simulation takes is at most exponential in BW{Gq), and polynomial in the number 
of gates in Q. 

We note that following similar arguments to the proof of Theorem II. H we can actually also 
calculate the exact inner product of Qx with any output string y. 

Theorem 11.11 is essentially the result in [9] though that result is stated for tree- width instead of 
bubble width and is restricted to quantum circuits Q instead of operator circuits (the proof of [9] 
works for operator circuits, a fact undoubtably known by the authors). Alternate proofs of similar 
results to the above were given in [7|. Here we provide yet two more proofs of the above result. 
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The first, contained in section [5l is self-contained and gives a clear picture of the association of the 
bubble width with the computation of the circuit. The second, contained in section [HI only holds 
for case when Q is indeed a quantum circuit, and uses the intriguing connection between quantum 
circuits and the Jones polynomial [Ij. This proof is very short if one is willing to rely on results 
from those areas. 

We then turn to the quantum Fourier transform and show: 

Theorem 1.2 There exists an operator circuit which applies the quantum Fourier transform on n 
qubits to within precision 0{l/n) and whose bubble width is 0{log'^{n)). 

The design of the operator circuit is based on a result of Cleve and Watrous [5] who gave logarithmic 
depth (non-planar) quantum circuits for the Fourier transform. Their circuits were of linear bubble 
width, but using the relaxation from unitary quantum circuits to operator circuits, we can show 
how to make the bubble width polylogarithmic. 

The combination of these two theorem has an intriguing conclusion: the quantum Fourier 
transform has an efficient classical simulation. Of course, there is not much we can learn from 
applying the Fourier transform circuit on a classical input string, and studying the probability for 
some output; we already know that for any classical input string, the outcome will be distributed 
uniformly. The above statement is thus of little meaning in the context of classical inputs. The 
reason it might be of interest never the less is because of the hope to apply it to more interesting 
circuits, which may include the Fourier transform as a subroutine. The above statement shows 
that there is reason to believe that the Fourier transform part in the circuit will not be the obstacle 
towards classical simulations of such circuits. 

At this point the reader might wonder why this result does not imply that factoring can be 
performed classically, since it seems that the quantum Fourier transform is the only truly quantum 
part of Shor's algorithm, i.e, the only part that is hard to simulate classically. The problem is 
extending the result to the entire Shor's algorithm lies in the first part of Shor's algorithm, namely, 
the modular exponentiation, which seems like a "classical" part. Even though the circuit is classical, 
it is performed on a superposition of all strings, and so we cannot simply simulate it by a classical 
circuit of the same size. The problem in attempting to use our methods is that we would need to 
show how to perform the modular exponentiation so that the resulting circuit, and moreover, the 
combined circuit with the QFT circuit, has small bubble width. 

An interesting open question is to ask whether these results can be used in other contexts. One 
way that one might hope to use this is in order to estimate the Fourier coefficients of interesting 
quantum states; if a quantum state can be generated with a small bubble width circuit, and if the 
Fourier transform subroutine does not increase the bubble width significantly (as is the case for 
instance for states coming from log-depth planar circuits), then the Fourier coefficients of the state 
can be calculated efficiently classically. This might be a way to derive efficient classical algorithms 
for certain tasks, by first constructing a small bubble-width operator circuit for the task. 

Related work 

After completing this work, we have learned that similar results were achieved independently by 
Yoran and Short around the same time [16]. The results in [16] are in fact somewhat stronger, as 
they achieve not only quasi-polynomial simulation of the QFT but rather a polynomial simulation. 
The methods are different, and we believe there is independent merit for both results. 
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2 Graph Parameters: Bubble Width, Tree Width, Path Width 

2.1 Notation 

For a finite set S, \S\ will denote the number of elements of S. Given a finite graph G, we shall 
denote by v{G) the vertices of G and E{G) the edges of G. For a given graph G and vertex v, the 
star graph Gv shall be the subgraph of G consisting only of the edges and vertices in G connected 
to V. 

2.2 Bubble Width 

Definition 2.1 Bubble Width Given a graph, a bubbling B of G shall mean an ordering of all 
the vertices of G, 

61,62, ■■■,bn. 
This ordering induces a sequence of subsets 

Si G S2 G ■ ■ ■ G Sn 

with Si = {bi, . . . ,bj}. For each i, we define Zi{B) C E{G) to be the set of edges with exactly one 
endpoint in Si. The width of B shall be maxj |2:j(i?)|. The bubble width of G, denoted BW{G), is 
defined to be the minimal width over all bubblings of G. 

2.3 Tree-width, path width and the connection to Bubble-width 

We show that the parameter bubble-width is tightly related to the well studied notions [3j of 
tree-width and path width. 

Definition 2.2 Tree- Width, Path Width A tree decomposition of a graph G is an undirected 
tree T, where each node t ^ T is assigned a subset t of the nodes of G. The condition for this to be 
a tree- decomposition is 

1. For each edge {v,w) in G, there must exist a node t ^T whose subset contains both v and w. 

2. If V €z V{G) appears in two subsets ti,t2 € T, then v must appear in all subsets on the 
(unique) path between ti and t2- 

The width of the tree decomposition is the maximum over all nodes t inT of the number of nodes 
in the subset t. The tree-width of G, denoted by TW{G), is the minimal possible width of all 
tree- decompositions of G. 

The path width PW{G) is defined similarly, except that instead ofT being a tree, we constrain 
T to be a path (i.e. a tree with all nodes of degree at most 2). 

It is well known that 

Lemma 2.3 (Korach and Sold ;8j) Given a graph G ofn vertices, TW{G) < PW{G) < 0{log{n))TW{G) 
where n = \V{G)\. 

It turns out that the bubble width is tightly connected to the familiar notion of path- width. 

Lemma 2.4 Consider a graph G of n nodes, where each node has degree bounded by an overall 
constant (d). Then ^PW{G) < BW{G) < d ■ PW{G). 
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Proof: Suppose the bubble width of G is BW{G) is achieved by the bubbhng 61, 62, ■ ■ ■ Define 

ti = {v E G : an edge of Zi{B) is connected to v }, in other words tj consists of all vertices that 
are connected to edges that cross the boundary of the bubble at the ith step. It is straightforward 
to verify that the path of length n — 1 which has the subset ti associated to its ith vertex is a path 
decomposition; it is also clear that the path width for this decomposition is at most 2BW{G) and 
thus lPW{G) < BW{G). 

Given a path decomposition T with assigned subsets ii,t2 - ■ ■ create a bubbling of the vertices 
of G as follows: list, in any order, the vertices in ti, then list in any order those vertices in t2 that 
were not in ti, then list those vertices in that are not in t2 in any order, etc. Let bi,b2, ■ ■ - bn 
be this order of the vertices. We analyze the width of this bubbling. For each i, let ji be the 
index of the first ij^ for which 6, G For any edge (a, b) with a e Si = {bi, . . .bi} and b ^ Si, 
notice that it must be the case that a Eta for some a < ij and 6 G 4 for some b > ij. It follows 
from the conditions on path decompositions, that we must have a G tj.. Thus for every edge in 
Zi{B), at least one of the vertices is contained in fj.. It follows then that < d\tj.\ and thus 

BW{G) < dPW{G). m 

We can combine the above two lemmas to obtain the following statement: 

Lemma 2.5 The three parameters, bubble-width, path-width, and tree-width are equal up to poly- 
logarithmic factors. 

3 Labeled Graphs and Operator Circuits 

Definition 3.1 Given a finite graph G, an edge labeling I of G will be a map I : E{G) {0, 1}. 
If H is a subgraph of G, then a labeling of G induces a labeling of H , we will refer to this labeling 
of H by I as well. 

Let ^ be a two dimensional vector space with orthonormal basis vectors |0) and For a set 
of edges E of some graph, we shall let be the vector space of the tensor product of \E\ copies 
of A, one corresponding to each clement of E. For a labeling I of E, the notation a^^^^ shall mean 
the basis vector of A^^ corresponding to the tensoring together of the basis element \l{e)) in the 
component of A'^^ corresponding to the edge e. Thus the set of cc'^^^ as I ranges over all labelings 
of E is an orthonormal basis of A®^ . 

Definition 3.2 Given a finite graph G, for each vertex v E G, a tensor associated to v shall be a 
map ruy from the set of labelings of Gy to C. 

The tensor ruy induces many linear maps which we describe here. Let E = E{Gy) be the set of 
edges adjacent to v. Then determines a linear map m^'® : A®^ C given by the equation 

mf«(a'(^))=m„(0, 

for all labelings I of Gy. In addition, for any partition of E into two sets, E = E1WE2, 
determines a linear map m^^'^^ : — > A®^'^ implicitly determined by the equation that for all 
labelings Z of G^, 
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where l{Ei) is the labehng oi Ei d E induced by the label I of E. Finally, we define the map 
mS'-^ : C ^ A®^ given by 

i: Z is a label of 

Definition 3.3 Given a finite graph G, a tensor assignment m to G will be a specification of a 
tensor rriy to every vertex v of G. 

Definition 3.4 Tensor Circuit A Tensor circuit T = {G, M) shall he any graph G with a tensor 
assignment M . The value of the circuit, denoted T{G, M) shall he defined as follows: 

E n "^''(0- 

I : I is a lahel of G veV{G) 

4 From Operator circuits to Tensor Circuits 

We would like to associate with an operator circuit Q and an input string x, a tensor circuit Tq. 
We shall do this in two steps, first wc modify the operator circuit Q to a new operator circuit Q' , 
then we associate a tensor circuit to Q' . Given a linear gate g : A"^ A" going from niton qubits, 
we define the adjoint gate g* : A^ A™" that is determined by the following: for all x € A" and 
y £ vl™, {y\A*x) = {Ay\x). Given an operator circuit Q, define the operator circuit Q' as follows: 
first apply Q, then apply on the answer qubit the operator that projects onto |0), and finally apply 
the "adjoint" of Q, i.e. the circuit that is Q flipped upside down with each gate g replaced by the 
adjoint gate g* . We leave it to the reader to verify that the inner product between an output string 
X and Q' applied to an input string x is the norm squared of Q{x)q (recall Q{x)q denotes the vector 
that is the projection of Q applied to x onto the subspace that has the answer qubit 0). We now 
describe the tensor circuit Tq{G,M). The graph G shall be the graph associated with the circuit 
Q'. The tensor assignment M is as follows: 

• For a vertex v oi G corresponding to a linear gate g : A" A"^ of Q' , let Ei (respectively 
E2) be the edges in G corresponding to the n input qubits (respectively m output qubits) 
that meet at v. We assign to v the tensor for which the associated linear map m^^'^^ is 
the linear gate g. 

• For the vertex v of G corresponding to the gate that projects onto |0) in the answer qubit 
(which has degree 2) we define the tensor associated to v by 

m^(|0)|0)) = m„(|l)|l)) = 1, m,{\0)\l)) = m„(|l)|0)) = 0. 

• For the vertices v corresponding to the i^^ input or output qubit of Q' (which have degree 
1), we define the tensor associated to v by my{\xi)) = 1 and my{\xi © 1)) = 0. 

With this construction we have the following connection between the operator circuit Q and 
the tensor circuit Tq {G, M) : 

Lemma 4.1 The value of the tensor circuit Tq{G, M) defined above is the norm squared of Q{x)q. 

Proof: It is straightforward to verify that the value of the tensor circuit is the inner product of 
X with Q' applied to x. The result then follows from the observation made earlier that this latter 
inner product is equal to the norm squared of Q{x)q. ■ 
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5 Efficient Simulations of Operator Circuits of logarithmic Bubble- 
Width 



We want to prove Theorem 1 We start by moving from an operator circuit Q and an input vector 
X to its tensor circuit Tq{G,M) as in the previous section. We highlight the connection between 
the bubble width of G and that of the graph associated with Q: 

Lemma 5.1 Given Q and G as above, BW{G) < 2BW{Q) + 1. 

Proof: (Zeph: Dorit. . . I leave it to you to say something here) ■ 

The following theorem, when combined with lemmas 15.11 and 14.11 implies Theorem 11.11 

Theorem 5.2 Given a tensor circuit T{G, M), its value can he computed classically in time polyno- 
mial in \V{G)\2^^^^^ . In particular if BW{G) is logarithmic in \V{G)\ then the time is polynomial 
in the size of the graph. 

Proof: We will produce vectors ^l^i, I < i < with ipi G j[(Bizi{B) ^ fpj^g ^^^g^ vector, ipn (a 

scalar since Zn{B) is empty) is the value of the circuit. The map from ipi to tpi+i will be a linear 
map. Our result will then follow. 

The main idea here is the following. A tensor assigned to a vertex induces many linear maps; 
we choose the linear maps that minimize the number of computational steps. The choice will be 
determined by the best bubbling. Let 6i , . . . , 6„ be the bubbling of G which achieves the bubble 
width of G. Now let ijji = mf,'^^^^\l) € j\^zi(B) ^ (note that zi{B) consists of all the adjacent edges 
of vi). Given V'i-i £ -^^^g show how to compute tpi. Split the incident edges of Vi into two 

groups El and E2 where Ei is the set of edges that are in Zj_i(i?) and E2 is the set of edges in 
Zi{B). It follows that Ei ]J E2 is the set of all edges incident to Vi and Zi-i{B) — Ei = Zi{B) — E2- 
We now set -04 = "i^^''^^(^j_i), where ruv^'^^ : j^(^Zi{B) -g linear map that is the 

identity on yl®^.-i(s)-i?i = j^m,{B)-E2 Censor with the hnear map m^^'^^ : vl^^i ^ A'^^\ 

We leave it to the reader to verify that with these definitions, ipn ends up being the value of 
the tensor circuit T{G,M). 

The complexity of this algorithm is the sum of the complexities of the application of the linear 
maps that take ipi to V'i+i- There are |y(G)| such linear maps and the largest vector space encoun- 
tered is the tensor product of BW{G) copies of A and is thus of dimension 2-^^^'^). It follows that 
the complexity is polynomial in \V{G)\2^^^'^\ ■ 



6 Fourier Transform Circuit of logarithmic Bubble width 

We shall modify the construction by Cleve and Watrous ^ of the log-depth quantum circuits for 
Fourier transform to produce a circuit of poly-logarithmic bubble width. The modification takes 
advantage of the fact with the more general linear operator circuits, bits can be erased easily. In 
other words - the transformation 

|0),|1)^1 (1) 

where 1 is simply a scalar, is a valid transformation. 

We are interested in constructing an operator circuit that performs an approximation of the 
quantum Fourier transform. We begin with notation consistent with [5]. By \x) we shall mean 
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the basis state \x) = |x„_2) • • • \xo)- We define Ifie) = "^(|0) + e^'^'^ll)). Then the quantum 

Fourier transform is the hnear extension of the map 

1^:) \lpx) = |MO.xo)l/^O.a:ixo) • • • |M0.x„_i2,'„_i ...xq ) • 

We remark that as in [5], the state 

\lpx) = \fJ-0.xo)\ ■ ■ ■ I^O.Xfe_i • • •a;o)|/^O.Xfc-xi)|^O.Xfe+i • • •X2) • • • \f^0.x„-i-x„_k), 

where we replace each fj,Q by the approximation of by the first k digits after the decimal point, 
is a good approximation for \ipx) when k = 2log{n/e) + 0{1). Our construction will be of a circuit 
that applies the linear extension of the map |x) \il^x)- 

Our circuit will be composed of the product of three circuits applied sequencially: 

1 . The linear extension of the map defined by 

|x)-|a.) = |x^)|0'=)|xti)|0'=)...|xg)|0'=). 

2. The linear extension of the map defined by 

Wx) - 1/3.) = l4)|0'=-^)|^o..o)l4-i)|0'=-^)lmxi.o)---|4)|0'-^)lMo.x„....„_,). 

3. The linear extension of the map defined by 

\ax) \'>Px)- 

The first map is straightforward. We denote the map that makes one copy of a single qbit, i.e. 
the linear extension of the map defined by |0) |0)|0), |1) by the picture 




Ix ) Ix ) 

Then we can create k copies of each bit with a log k depth circuit by using 0{k) of these maps, as 
in the following picture: 



Now, we insert an extra bunch of k qubits in the state |0) to the right of each bunch of copied 
qubits, using the hnear operator 1 i — |0). 

The third map is also straightforward since we are using Unear circuits and we do not require 
unitarity of the gates. Notice that \^x) can be gotten by eUminating all bits except those in the 
2A:th, 4A:th, 6/cth etc location. Unlike in the unitary case, where a lot of effort was put into getting 
rid of the remaining so called computational bits, here we independently at each location apply the 
simple transformation which takes all those bits to the scalar 1. 

The more involved component is the second circuit. Following [5], the circuit below is the linear 
extension of the map: 

\x)\0'^) ^ \x)\0^'')\fSO.cc,.,.,....,.,+,), 

which can be implemented according to the following diagram: 



10) 



cut cm c> 



10) 



10) 



IO)ka,^....x^_J 



where the "H" gate is the Hadamard gate and the gates with one open and one closed circle are 
C-NOT gates. The gates with two open squares, though depicted as identical to each other, are 
different controlled-phase shift gates which we now describe. Define the controlled-phase shift map 
c — P{9) to be the map defined by \x)\y) — > e^'^^^^'^jx) Then the two open square gate in the 
above diagram that acts on \xi) is c — ^(2'"-^"^"^). 

Thus to implement the second map we apply in parallel the gates Aj,0<j<n — Ito the 
state \ax) in the following way: Aj acts on the strands of la^;) corresponding to the k + 1th copy 
of each \xi) it needs as well as those corresponding to the A; + 1 th block of jO'^). Thus the Aj act 
on disjoint sets of strands (for different j) and therefore they can be applied in parallel. We note 
that each Aj has "width" no bigger than 2k'^, i.e. the distance between any two strands that Aj 
acts on is no more than 2k'^. It should be clear that the application, in parallel in this way, of the 
gates Aj,0<j<n — 1 implements the map \ax) — > \(3x)- 

We have completed the description of the circuit that implements the approximation of the 
Fourier transform. It is left to upper bound the bubble width of the above operator circuit. To do 
this we describe a certain bubbling which will provide an upper bound on the bubble width. The 
bubbling is very simple: we bubble from left to right. The precise order does not matter as long as 
the bubbling swallows gates above and below things it has already swallowed before swallowing too 
many things farther to the right. The resulting width for this bubbling is no more than quadratic 
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in k (and thus by choice of k poly- logarithmic). The reason for this relies on two features of the 
circuit: a) the circuit has depth linear in k, and b) the "width" of any gate encountered is no more 
than quadratic in k. 

This completes the proof of Theorem ll.2i 

7 Remarks on why the simulation fails for Shor's algorithm 

It is natural to ask whether these techniques can be extended to provide an efficient classical 
simulation of Shor's algorithm. All our attempts to do so have failed, and it seems that there is 
an inherent difficulty here. The reason is that the modular exponentiation part in the algorithm 
requires multiplication, and to the best of our knowledge, the bubble width of multiplication circuits 
is close to linear. One might hope to try and avoid this problem by using simpler operations that 
would suffice for factoring. However, all our attempts to do so encountered yet another problem 
which seems difficult to handle: the bubble width is not additive. One can connect circuits of very 
small bubble width, to get a very large bubble width. Hence, not only that the different parts of 
the factoring circuit need to have small bubble width, but their connections need to be designed in 
such a way that the bubble width of the entire circuit is still small. 

8 Epilogue: The proof of Theorem 11.11 using the Jones polynomial 
technique 

Here we sketch an alternative, short proof of Theorem 1 1.11 in the case when the operators involved 
in the circuit are unitary. We assume familiarity with the notions of the Jones polynomial, braids, 
and the statements of the recent results in quantum computation regarding these notions [21 [1]. 
More background can be found in [2] and [T|. The proof is achieved by combining the quantum 
universality of the Jones polynomial [6l [1] with the well known fact that the Jones polynomial of a 
braid can be calculated in time at most exponential in the tree-width of the graph underlying the 
braid [lIlE]. 

Proof: Given a quantum circuit Q on n qubits and with s gates, whose bubble-width is poly- 
logarithmic, we perform the following steps: 

1. We create a quantum circuit Q' , of n' qubits and s' gates, such that: a) n',s' are at most 
polynomial in n,s, b) the probability that Q outputs is equal to (O'^IQ'lO'^). 

2. We create a braid b whose Jones polynomial at a particular root of unity is inverse-polynomially 
close to (O^IQIC^). The graph corresponding to b will have poly-logarithmic bubble width. 

3. We classically evaluate the Jones polynomial at the particular root of unity in quasi-polynomial 
time. 

StepIDis a standard construction in quantum computation-see [lO] or if you are really desperate, 
pages 9 — 10 in ^ (we note it is very similar to the construction from Q to Q' in section S]). It 
is simple to verify that the bubble width of Q is no more than one more than twice the bubble 
width of Q' (again, this is the same result as lemma [5TT]) . Step [2] follows from the results of [6l[T]. 
Specifically, the braid b has 4n strands, and each gate in the original circuit is replaced by poly- 
logarithmically many crossings in the braid 6, on the 4 or 8 strands corresponding to the qubit or 
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qubits involved in the gate. It is straightforward to see that the bubble-width of the underlying 
graph of the braid (the underlying graph is the graph obtained by replacing every crossing by 
a vertex) remains poly-logarithmic. Consequently, Lemma 12.51 implies that the tree-width of the 
underlying graph of this braid is poly-logarithmic as well. Step [3] follows from the known result 
im [3] which states that the Jones polynomial at any point, of a braid whose underlying graph has 
poly-logarithmic tree-width, can be calculated in time which is quasi-polynomial. 
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